Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform

نویسندگان

  • Ali Mosleh
  • Nizar Bouguila
  • A. Ben Hamza
چکیده

A slew of semantic image content analysis techniques are specialized in extracting text embedded in images since it is a vital source of semantic information. A robust text detection step is the basic requirement for a scheme designed to extract text information from images. Text detection is still a challenging issue due to unconstrained color, sizes, alignments of characters, lighting and also various shapes of fonts, even though various methods have been proposed in the past years [2]. Existing text detectors can be broadly classified into two main groups: texture (also called region) based and connected component (CC) based methods. The general scheme of our proposed method consists in producing (Fig. 1) the image edge map and then finding CCs based on stroke width transform (SWT) [1] guided by the generated edge map. Next, precise feature vectors are formed using the properties of CCs from SWT and pixel domain. An unsupervised clustering is performed on the image CCs to detect the candidate text CCs. Finally, text candidate components are linked to form text-words. The method is considered as a CC-based technique and the contribution is twofold: 1) Since accurate edge maps drastically enhance SWT results, a precise edge detection approach adaptive to text-regions is proposed by employing the bandlet transform. 2) A feature vector based on text properties and stroke width values is employed in k-means clustering in order to detect text CCs. Bandlet transform [3] effectively represents the geometry of an image. The image coefficients are dyadically segmented in squares S for polynomial flow approximation of the geometry before the bandletization process. Since the image coefficients are all warped along local dominant flows in the bandlet transform, the final bandlet coefficients generated for each segmentation square S have the form of approximation, and highpass filtering values appear in the wavelet transform of a 1D signal. We benefit from the bandlet-based resulting 1D high-pass frequency coefficients that are adapted to the directionality of the edge that exists in each segmentation square S in order to find a binary map of the edge positions in the image. Since the approximation part of the bandlet transform resulting coefficients consists of coarse information of the original signal, we discard it and only process the high-pass coefficients. The first-order derivatives of the fine-detail bandlet coefficients are computed. Then, local maxima of the resulting gradient signal are found using a contextual filter as follows:

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

MOSLEH ET AL.: IMAGE TEXT DETECTION USING A NOVEL EDGE DETECTOR AND SWT1 Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform

In this paper, we propose a text detection method based on a feature vector generated from connected components produced via the stroke width transform. Several properties, such as variant directionality of gradient of text edges, high contrast with background, and geometric properties of text components jointly with the properties found by the stroke width transform are considered in the forma...

متن کامل

Directional Stroke Width Transform to Separate Text and Graphics in City Maps

One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...

متن کامل

آشکارسازی و تعیین مکان متون فارسی - عربی در تصاویر ویدیویی

Video text detection plays an important role in applications such as semantic-based video analysis, text information retrieval, archiving and so on. In this paper, we propose a Farsi/Arabic text detection approach. First, with an appropriate edge detector, edges are extracted and then by using edges cross ponts, artificial corners are extracted. Artificial corner histogram analysis is done for ...

متن کامل

Egyptian License plate Recognition using Enhanced Stroke Width Transformation and fuzzy ARTMAP

License Plate Recognition (LPR) is the most important type of Intelligent Transportation System (ITS). LPR is used in many different types of ITS like electronic payment systems, toll station, parking fees, freeway and arterial management systems for traffic surveillance. Few years ago, Egyptians government changed the car license plate to include letters and numbers. So the needs for efficient...

متن کامل

Text Extraction in Natural Scenes using Region-based Method

Text in images is a very important clue for image indexing and retrieving. Unfortunately, it is a challenging work to accurately and robustly extract text from a complex background image. In this paper, a novel region-based text extraction method is proposed. In doing so, the candidate text regions are detected by 8-connected objects detection algorithm based on the edge image. Then the non-tex...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012